Imitation Learning in The Game of Go with Joseki Options

نویسندگان

William Dabney

Amy McGovern

چکیده

Scaling reinforcement learning methods to large, challenging decision making tasks can potentially benefit from integrating domain specific knowledge in a principled manner. This synthesis focuses on applying two forms of domain knowledge about the game of Go to improve learning performance on what continues to be an extremely challenging task. First, learning is bootstrapped by using reinforcement learning to learn to imitate expert Go players. This utilizes databases of expert game records as a source of training experiences. Second, reusable options are automatically created from a joseki database, and reinforcement learning is used to learn when to apply these joseki during a game. Together these improve the performance of a reinforcement learning agent against a much better computer Go player.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Design and Implementation of a Heuristic beginning Game System for Computer Go

There are roughly three stages in a Go game: the beginning game, the middle game, and the end game. This paper describes a computer Go beginning game system which includes occupying corners, joseki, extending edges, and dealing with Moyo. This beginning game system has been used in a computer Go program named Jimmy 4.0. Having been tested by professional Go players, this system is estimated at ...

متن کامل

Active Opening Book Application for Monte-Carlo Tree Search in 19×19 Go

The dominant approach for programs playing the Asian board game of Go is nowadays Monte-Carlo Tree Search (MCTS). However, MCTS does not perform well in the opening phase of the game, as the branching factor is high and consequences of moves can be far delayed. Human knowledge about Go openings is typically captured in joseki, local sequences of moves that are considered optimal for both player...

متن کامل

Applying Data Mining to the Study of Joseki

Go is a strategic two player boardgame. Many studies have been done with regard to go in general, and to joseki, localized exchanges of stones that are considered fair for both players. We give an algorithm that finds and catalogues as many joseki as it can, as well as the global circumstances under which they are likely to be played, by analyzing a large number of professional go games. The me...

متن کامل

Combination of real options and game-theoretic approach in investment analysis

Investments in technology create a large amount of capital investments by major companies. Assessing such investment projects is identified as critical to the efficient assignment of resources. Viewing investment projects as real options, this paper expands a method for assessing technology investment decisions in the linkage existence of uncertainty and competition. It combines the game-theore...

متن کامل

An Adaptive Learning Game for Autistic Children using Reinforcement Learning and Fuzzy Logic

This paper, presents an adapted serious game for rating social ability in children with autism spectrum disorder (ASD). The required measurements are obtained by challenges of the proposed serious game. The proposed serious game uses reinforcement learning concepts for being adaptive. It is based on fuzzy logic to evaluate the social ability level of the children with ASD. The game adapts itsel...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2010

Imitation Learning in The Game of Go with Joseki Options

نویسندگان

چکیده

منابع مشابه

Design and Implementation of a Heuristic beginning Game System for Computer Go

Active Opening Book Application for Monte-Carlo Tree Search in 19×19 Go

Applying Data Mining to the Study of Joseki

Combination of real options and game-theoretic approach in investment analysis

An Adaptive Learning Game for Autistic Children using Reinforcement Learning and Fuzzy Logic

عنوان ژورنال:

اشتراک گذاری